Basic Statistics

Raw Counts

Name Value
Rows 5,413
Columns 9
Discrete columns 7
Continuous columns 2
All missing columns 0
Missing observations 0
Complete Rows 5,413
Total observations 48,717
Memory allocation 16 Mb

Percentages

Data Structure

Missing Data Profile

## Warning: `aes_string()` was deprecated in ggplot2 3.0.0.
## ℹ Please use tidy evaluation idioms with `aes()`.
## ℹ See also `vignette("ggplot2-in-packages")` for more information.
## ℹ The deprecated feature was likely used in the DataExplorer package.
##   Please report the issue at <]8;;https://github.com/boxuancui/DataExplorer/issueshttps://github.com/boxuancui/DataExplorer/issues]8;;>.
## This warning is displayed once per session.
## Call ]8;;x-r-run:lifecycle::last_lifecycle_warnings()lifecycle::last_lifecycle_warnings()]8;; to see where this warning was generated.

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 3 columns ignored with more than 50 categories.
## NM_PRODUCAO: 5411 categories
## DS_PALAVRA_CHAVE: 5331 categories
## DS_RESUMO: 5362 categories

QQ Plot

Correlation Analysis

## 4 features with more than 20 categories ignored!
## NM_ENTIDADE_ENSINO: 30 categories
## NM_PRODUCAO: 5411 categories
## DS_PALAVRA_CHAVE: 5331 categories
## DS_RESUMO: 5362 categories

Principal Component Analysis

## 3 features with more than 50 categories ignored!
## NM_PRODUCAO: 5411 categories
## DS_PALAVRA_CHAVE: 5331 categories
## DS_RESUMO: 5362 categories